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1 .   Introduction. 

In  all  the  fire  distribution  problems  that  we  have  considered 
previously  [14]-[19]  an  implicit  assumption  has  always  been  that  fire  could 
be  instantaneously  shifted  from  one  target  type  to  another.   To  illustrate, 
let  us  recall  a  typical  problem: 


maximize  {ry(T)  -  px  (T)  -  qx  (T) }  , 
<j>(t)  L  L 

dxl 
subject  to:  -: —  =  -  $a  y, 

dx 

dT  =  -(l-*)a2y, 


dt1  =  "  blXl  "  b2X2'  (1) 


x  ,x  ,y  2:  0   and   0  £  <f>  £  1, 

and  with  initial  conditions 

X;L(t=0)  =  x°,   x2(t=0)  =  x°,   y(t=0)  =  yQ. 

In  this  problem  <j>   (the  fraction  of  the  Y-forces  which  fires  at  X  ) 
is  the  control  (decision  or  policy)  variable.   The  reader  should  note  that 
although  the  control  must  satisfy  the  condition  0  £  <|>  £  1 ,   the  rate  of 
change  of  4>   is  unrestricted  so  that  $      can  instantaneously  change,  for 
example,  from  0   to   1.   Physically,  this  means  that  we  are  assuming 
that  the  Y-forces  can  instantaneously  shift  fires  as  desired. 

When  one  considers  command  and  control  problems  in  combat,  the 
above  implicit  assumption  on  $   (instantaneous  jumps  permitted)  does  not 
seem  to  be  a  realistic  one.   A  better  assumption  appears  to  be  that 
there  is  a  limit  to  how  fast  $      can  be  changed.   For  example,  consider  a 


homogeneous  Y-force  in  combat  against  heterogeneous  enemy  forces.   A 
command  and  control  system  directs  the  fire  of  the  homogeneous  Y-force 
against  particular  enemy  targets.   The  effectiveness  of  the  command  and 
control  system  might  be  measured  in  terms  of  the  speed  and  accuracy  with 
which  units  of  the  Y-force  react  to  orders  as  to  which  type  of  enemy  unit 
at  which  to  fire  (see  [13]  for  some  similar  ideas) .   This  fire  distribution 
process  may  be  described  in  terms  of  a  distribution  of  fire  variable  <j> . 
We  have  thus  been  led  to  consider  target  selection  problems  in  which  the 
rate  of  change  of  the  allocation  variable  is  bounded  (i.e.  instantaneous 
shifts  in  fire  are  not  allowed) .   For  reasons  discussed  below,  we  have 
chosen  to  call  such  a  situation  "inertial  combat." 

Although  problems  in  which  curves  are  restricted  to  lie  in  a  given 
domain  were  considered  in  the  classical  calculus  of  variations  as  long  ago 
as  1831  [5]  (see  also  [1])  and  discussed  by  Weierstrass  in  his  lectures  of 
1879  (see  p.  395  of  [1]),  development  of  optimality  conditions  for  optimal 
control  problems  with  state  variable  inequality  constraints  has  been 
accomplished  only  comparatively  recently.   As  the  author  pointed  out  in 
[19],  state  variable  inequality  constraints  (SVIC's)  are  present  in  all 
Lanchester-type  optimal  control/differential  game  problems.   Recent  activity 
in  developing  necessary  conditions  of  optimality  for  problems  with  SVIC's 
apparently  owes  its  origin  to  the  work  of  Gamkrelidze  (for  an  English 
translation  of  his  original  work  see  Chapter  VI  of  [11]).   Gamkrelidze 
points  out  that  in  many  physical  problems  there  are  restrictions  not  only 
on  the  control  parameters  but  also  on  the  state  (phase)  space.   He  (see 
p.  263  of  [11])  refers  to  piecewise  continuous  controls  as  "inertialess 
controls,"  since  such  controls  can,  if  need  by,  instantaneously  jump  from 


one  value  to  another.   Following  Gamkrelidze  then,  we  use  the  term  inertial 
combat  to  refer  to  a  target  selection  problem  in  which  the  rate  of  change 
of  the  distribution  of  fire  is  bounded. 

In  [17]  the  author  first  applied  the  theory  of  SVIC ' s  (the  approach 
of  Gamkrelidze  [11])  to  an  allocation  problem  in  the  Lanchester  theory  of 
combat.   In  this  first  application  little  more  than  developing  optimality 
conditions  on  constrained  subarcs  was  done.   In  [19]  we  introduced  a  more 
convenient  approach  to  first  order  SVIC's  (the  approach  of  Speyer  (see  [6])) 
and  used  theory  (including  corner  conditions  and  boundary  conditions  for 
the  adjoint  variables  (see  also  [21]))  to  completely  solve  a  problem 
similar  to  (1).   The  paper  at  hand  further  extends  such  results:   we  con- 
sider  a  problem  with  a  second  order  SVIC  (as  well  as  first  order  SVIC  s) . 
This  application  is  possible  because  of  theoretical  results  recently  obtained 
by  the  author  who  extended  Gamkrelidze 's  multiplier  condition  [11]  (see  also 
[20])  to  a  p —  order  SVIC  [22]  (see  also  [21]).   The  reader  can  find  a 
further  discussion  of  the  theory  of  SVIC's  in  [17]  and  [19]  (see  also  [6] 
and  [10]). 

This  paper  is  organized  in  the  following  fashion.   First,  we  discuss 
the  optimal  control  problem  (optimal  fire  distribution  in  the  presence  of 
command  and  control  limitations).   Then,  the  basic  necessary  conditions 
of  optimality  are  developed  for  the  problem.   Next,  the  synthesis  of  the 
extremal  fire  distribution  policy  is  outlined  in  several  cases.   The 
determination  of  the  optimal  fire  distribution  policy  is  discussed,  and 


This  terminology  was  apparently  first  coined  by  Bryson,  Denham,  and  Dreyfus 
[3]  .   They  say  that  a  problem  has  a  pill  order  SVIC  when  the  pill  time 
(total)  derivative  of  the  state-variable  constraint  is  the  first  derivative 
to  explicitly  contain  the  control  variables. 


a  more  general  model  considered.  Finally,  we  discuss  the  insights  gained 
into  the  optimization  of  combat  dynamics  from  our  study  of  the  problem  at 
hand. 

2 .   The  Fire  Distribution  Problem  in  the  Presence  of  Command  and  Control 
Limitations . 

Accordingly,  we  consider  the  following  problem: 


maximize  {ry(T)  -  px. (T)  -  qx  (T) }  with  T   specified, 


u(t) 


dxl 
subject  to:   ^—  =  -(fra^y 

dx 


"  =  -(l-4>)a0y, 


dt     v  T'  2' 

d?  =  "  blXl  "  b2X2' 

ft     -   «.  (2) 

x  ,x_,y  ;>  0,  T  <:  T  ,  Oif^l,  and  -  II  s:  u  s:  EL., 

and  with  initial  conditions 

Xl(t=0)  =  x°,   x2(t=0)  =  x°,   y(t=0)  =  yQ,   <t>(t=0)  =  4>Q, 

where  all  symbols  are  defined  in  the  next  section  and  IL  ,R.T  >  0.   It 
should  be  noted  in  the  above  model  it  is  no  longer  possible  for  4"   to 
instantaneously  change  from,  for  example,   0   to   1  as  it  had  been  for 
(1) .  As  we  discussed  in  the  introduction,  this  is  how  we  incorporate 
command  and  control  limitations  into  our  model. 

We  will  focus  primarily  upon  the  development  of  the  basic  necessary 
conditions  of  optimality  for  (2)  and  the  synthesis  of  extremal  control  from 


these.   As  discussed  more  thoroughly  in  Section  8,  it  is  not  practical 
for  computational  reasons  to  completely  carry  out  the  determination  of 
the  optimal  control.   Thus,  for  this  more  limited  goal  of  characterizing 
the  optimal  fire  distribution  policy  the  nature  of  the  planning  horizon 
(terminal  target  set  or  prescribed  duration)  doesn't  make  any  difference, 
and  we  will  be  purposely  vague  on  this  point. 

The  reader  should  note  that  the  control  variable  in  problem  (2)  is 
u,  while  <£   (the  control  variable  in  problem  (1))  is  now  a  state  variable 
Hence,  the  restriction  0  £  $  £  1   is  now  equivalent  to  two  (first  order) 
SVIC's.   When  we  use  the  approach  of  Gamkrelidze  (see  Chapter  VI  of  [11]) 
(as  modified  by  Bryson  et  al.  [3]  (see  also  [6])),  a  SVIC  such  as 
C(t,x.(t))  £  0   is  replaced  by  the  point  constraint 

C(t   fc   ,x.)  =  0, 
entry'  l 

and  the  control  inequality  constraint  on  the  state  boundary   (C=0) 

AC 

—  (t,x.,u)  <;  0  for  t  €  [t  .   ,t   ,  1. 
at     i  entry  exit 

Thus,  for  <j>  -  1  ^  0 ,   we  treat  boundary  arcs  when  <j>  =  1  by  considering 

cb(t     )  =  1   and  then  requiring 
Y   entry 

4^-=u<;0   for   t  €  [t     ,t   .]   when  <j>  -  1  =  0,       (3) 
dt  entry  exit         y 

and  for  -<j>  £  0,   we  treat  boundary  arcs  when  <J>  =  0  by  considering 

d>(t     )  =  0  and  then  requiring 
*   entry  H     & 

-4^=-u^0   for   t€[t     ,t   .]   when  -  <j>  =  0 .     (4) 
dt  entry  exit 


However,  see  [14]-[18]  for  the  type  of  considerations  (i.e.  enumeration 

of  all  possible  terminal  states)  required  for  developing  a  complete  solution 

to  such  a  problem. 


To  avoid  being  encumbered  by  too  many  symbols,  we  will  consider 
only  one  of  the  two  SVIC's  x..  ,x~  ^  0.   Clearly,  we  lose  no  generality  in 
considering  x  ^  0.   In  this  case,  we  have 

C(t,x±)  -  -  xx  as  0,  (5) 

ac  dxi 

£<'•*!>-- aT  =  *V-  (6) 

d2C 

j^Z   (t,x±,u)  =  uaxy  -  ^(b^-j+b^)  .  (7) 

On  a  constrainted  subarc  on  which  x.,  (t)  =  0   for   t      si  t  si  t   .  , 

1  entry        exit 

the  SVIC  is  replaced  by  the  point  constraints 


and 


C(t     ,x.)  =  -  xn (t     )  =  0,  (8) 

entry'  V  lv  entry     ' 

ac  dxi 

^  (t      ,x.)  -  -  t—  (t     )  =  <Kt     )a,y  =  0,       (9) 

dt   entry'  i      dt    entry    TV  entry  V 

and  the  control  inequality  constraint 

0  (t,x.,u)  -  uV  -  ♦a1(b1x1+b2x2)  S  0  for  t  €  [t^.t^l.   (10) 


Thus,   x  :>  0   is  a  second  order  SVIC.   Clearly,  when  x  (t)  =0  for  a 
finite  interval  of  time,  by  (6)  we  must  have  4>*(t)  =  0   (since  y  >  0) 
and  then  (7)  yields  u*(t)  =  0,   where  we  have  considered  the  state 
equations  (2) . 

3.   Notation. 

The  symbols  which  are  used  in  this  paper  are  defined  as  follows: 

a1,a_,b1,b_  =  constant  attrition-rate  coefficients, 


<«><».>-& 


t=t 
e 


F(<}>,u)  =  rate  of  change  of  fraction  of  Y-fire  directed  at  X   in  more 
general  redistribution  of  fire  model, 

H  =  Hamiltonian  function, 

J  =  criterion  (return)  functional  =  ry(T)  -  px  (T)  -  qx2(T), 

p,q,r  =  utilities  assigned  to  surviving  X  ,   X   and  Y   forces 
respectively, 

Pj(t)   for   i  =  1,2,3,4  =  dual  variable  corresponding  to  x.(t) 

(x3(t)  =  y(t),   x4(t)  =  <J)(t)),    X 

1L  ,R  =  lower  and  upper  bounds  on  magnitude  of  rate  of  change  of 
(i.e.   -^  <L   u  <;  Ry), 

t  =  time  after  beginning  of  battle, 

t  =  time  of  entry  to  constrained  subarc, 
e 

t-  =  lim  t, 

e   t-t 
e 

t£t 
e 

o  o        o 

t  =  time  of  entry  to  constrained  subarc  with  4>(t)  =  0   for   t  £  t  s:  t. 

e    ,  ,  ..   .   ,     K  el 
(similarly  for   t  ) , 

t.  =  time  of  leaving  constrained  subarc, 

t  ■  time  at  which  u*(t)   switches  from  R.   to   -R.   with  0  <  $   <  1 ; 
S   defined  by  p  (t=t  )  =0, 

t,  =  time  at  which  X   is  annihilated,  i.e.   x  (t  )  =  0, 

T  =  time  at  which  battle  ends, 

T  =  maximum  possible  duration  for  battle,  i.e.   T  i  T  , 

u  =  control  variable  for  redistribution  of  fire, 

v  -  v(t)  =  a1(-p1(t))  -  a2(-p2(t)), 


W  =  Bellman's  optimal  value  function, 

i 
x-,x.,y  ■  combatant  force  levels;  with  initial  values  x  ,x  ,y  , 

5,6- ,6.  =  positive  constants, 

H-,  (t)  (n9(t))  =  multiplier  corresponding  to  state  variable  inequality 
constraint  4>  £  1   (<f>^0) , 

u(t)  =  multiplier  corresponding  to  state  variable  inequality  constraint 
Xx  5S  0, 

v  =  multiplier  corresponding  to  intermediate  equality  constraint  x  (t  )  =  0 
used  to  (formally)  handle  entry  to  a  constrained  subarc  with 
xL(t)  =  0. 

v.   for   i  =  1,2,3  =  multiplier  corresponding  to  state  variable  terminal 

inequality  constraint  x.(T)  ^  0   (x  (T)  =  y(T)), 

v  (v  )  =  multiplier  corresponding  to  state  variable  terminal  inequality 
constraint  <|>(T)  £  1   (<I>(T)  ^  0), 

<j)  =  fraction  of  Y-f ire  directed  at  X..  , 

t  =  "backwards  time"  from  the  end  of  the  battle;  defined  by  t  =  T  -  t, 
i.e.  the  time  remaining  before  the  end  of  battle, 

t  =  "backwards  time"  of  the  first  change  in  the  sign  of  the  switching 
function  v,   i.e.   v(t=T-x  )  ■  0. 

4.   Characterization  of  an  Optimal  Fire  Distribution  Policy. 

Using  Gamkrelidze's  approach  and  considering  (3),  (4),  (8),  (9), 
and  (10),  we  have  that  the  Hamiltonian  is  given  by  [2],  [6],  [11] 

H(t,xi,pi,u)  =  -P1t()a1y  -  p2(l-<t>)a2y  -  p^b^+b^)  +  p4u 

-  n1(t)u  +  n2(t)u  -  u(t){ua1y-<j)a1(b1x1+b2x2)},     (11) 

where 


,  =  0   for   <j>  <  1,  =  0   for   <J>  >  0 , 

1,(0  {  n2(t)  { 

1  ^  0  for  |  =  1,  {    >   0  for  <J>  =  0, 


and 


=  0   for  x  >  0, 

y(t)  {  x 

'  £  0   for  x  =  0. 

We  have  adopted  above  the  following  correspondence  between  state  and 
dual  variables: 

state  variable         dual  variable 


xi  pr 

x2  p2, 

y  p3, 


* 


V 


Again,  to  avoid  being  encumbered  with  too  many  symbols,  we  have  only 
considered  one  (i.e.   x  ^  0)  of  the  two  SVIC's  x  ,x  ^  0.   The  other 
SVIC  (i.e.   x  ^  0)  is  handled  in  a  similar  way.   The  adjoint  system  of 
differential  equations  for  the  dual  variables  is 


-  blP3  -  y(t)(f.a1b1,  (12) 


=  b2p3  -  u(t)<})a1b2>  (13) 


=  c()a1p1  +  (l-cfr)a2p2  +u(t)aLu*,  (14) 


=  (a1P1~a  p2)y  -  y (t)a1(b1x1+b2x2) .  (15) 

The  boundary  conditions  at   t  =  T   for  the  adjoint  (or  dual)  variables 

may  be  written 

P;L(t=T)  =  -p  +  V1,   p2(t=T)  =  -q  +  v2,   p3(t=T)  =  r  +  v3,   P4(t=T)  =  ^  -  \>5> 


dp, 

dt 

3H 

8xi 

dP2 

3H 

dt 

3x2 

dp, 
dt 

3H 

3y  ' 

dp4 

3H 

dt 

3d> 

10 


where  v.   for   i  =  1,...,5  are  undetermined  multipliers  which  require 
the  specification  of  additional  information  to  be  further  delineated.   To 
this  end,   let  us  consider  the  case  in  which  Y   loses  with  T  <  T    (i.e. 
T  defined  by  y(T)  =  0).   From  the  transversality  condition  H(T,x  (T), 
p  (T),u*(T))  =0,   we  obtain  p~(T)  =  0.   Then,  we  have  [2],  [21] 

p,(T)  =  -p  +  vL,   P2(T)  =  -q  +  v2,   p3(T)  -  0,   p4(T)  -  v4  -  v5,    (16) 

where 

=  0   for  x4(T)  >  0, 

(T)  =  0  but  x. (t)  >  0  for   t  <  T, 


j 

:or  i  =  1,2   v.  <  i  0  for  x,  I 


unrestricted  when  x.(t)  =0   for   t.  £  t  £  T 

l  l 

with  t.  <  T, 
i 


and 

=  0   for  <fr(T)  >   0,  =  0  for  <J>(T)  <  1, 


*  V  ^  n   fnr   AfTl  =  n  ^  \ 


Z   0  for  KT)  =0,  l  k  0  for  <f>(T)  =  1 


When  x..,x,-,y  >  0  and  0  <  <j>  <  1,   the  (extremal)  control  law  is 
determined  by  the  maximum  principle.   Hence,  we  consider 


and  this  yields 


maximize  H(t,x  ,p  ,u), 


R   for  p  (t)  >  0, 
u*(t)  =  {  4 

'  -R   for  p,(t)  <  0. 


(17) 


* 
Other  cases  are  handled  in  a  similar  manner.   See  [19]  for  a  problem  in 

which  the  boundary  conditions  for  the  adjoint  variables  are  worked  out  for 

all  the  battle's  end  states. 
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We  must  further  investigate  the  possibility  of  singular  subarcs  [7]  (also 
see  Chapter  8  in  [2])  on  which  —  -  0   for  a  finite  interval  of  time  (so 

oU 

that  all  its  time  derivatives  vanish) .   The  condition  that  -r—  =  0  yields 

3u 

that  on  a  singular  subarc  we  must  have 


P4(t)  =  0.  (18) 


The  condition  —  [— )    =   0   then  yields 
dt  ^3u^ 


a1P]_(t)  =  a2p2(t).  (19) 

Proceeding   to    the  next   time  derivative,   we  would  have  on  a   singular   subarc 
on  which    (18)    and    (19)    hold   that 

j£(f$    =yp3(a1b1-a2b2).  (20) 

From  (20),  we  see  that  a  singular  solution  is  impossible,  since  it  is 

j2   .-SVK 

impossible  (in  general)  to  have  -7-3-  h— J  =  0   for  a  finite  interval  of 
time. 

4.1.   Necessary  Conditions  of  Optimality  on  Constrained  Subarc  for  $_. 

On  a  constrained  subarc  on  which  <J>(t)  =  1   for   t  ^  t  £  t   the 

d<t> 
control  is  determined  by  —  =  0  and  hence 

dt 

u*(t)  =0   for   t  <  t  <  t  .  (21) 

3H 
The  multiplier   n,  (t)   is  determined  by  the  condition  - —  =  0  and  hence 
1  3u 

nx(t)  =  p4(t).  (22) 

The  requirement  that   ru  (t)  ^0  yields  that  on  the  constrained  subarc  we 
mus  t  have 
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P4(t)  :>  0.  (23) 


Differentiating  (23)  and  combining  with  (15)  ,  we  obtain 


n1(t)  =  y(a1p1-a2p2) ,  (24) 

so  that  Gamkrelidze's  condition  n,  (t)  £  0   [11]  (see  also  [20])  is  only 
satisfied  on  a  constrained  subarc  with  <J>  =  1  when 


a^-p^t))  ;>  a2(-p(t)),  (25) 

which  the  reader  will,  of  course,  recognize  as  a  result  for  the  correspond- 
ing "inertialess"  combat  problem  (see  [14],  [19]).   Denoting  the  time  of 

an  entrance  corner  by  t   and  that  of  an  exit  corner  by   t. ,   the  corner 

e  I 

conditions  (see  [17]  and  [19]  for  further  discussion,  especially  for 
corners  interior  to  the  state  space  (which  are  not  explicitly  discussed 
here))  yield  that  at  an  entrance  corner  we  have  [10] 

Pi(te}    =  Pi(te"}      f°r      i  =  l'2>3'  (26) 


and 


or 


P4(V    -°.-P4<<)    -V#.  «7) 


p4(0  -vO-  <28> 

where   t   denotes  a  left-hand  limit.   The  reader  should  note  that  (28) 
e 

is  in  consonance  with  (22).   Furthermore,  at  an  exit  corner  we  have 

Pi(V  =  Pi^j)   f°r   i  =  1'2'3'4"  (29) 

Considering  either  n-,(t  )  =  0  or  H(t~)  =  H(t  )  ,   we  find  that 
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P4(V    =  °   =   p4(t£'  (30) 

Considering    (24),    (28),   and  again     n,  (t   )    =0,     we  see   that  when   there 
is  an  exit   from  the  constrained   subarc  at      t  =   t    ,      then    (28)    yields 

+  +  }l 

P4(te)   =  n1(te)   =    ,      y(t){a2p2(t)    -  a^UHdt  £  0.  (31) 

t 
e 

On  a  constrained  subarc  on  which  $(t)  =0   for   t..  £  t  £  t~   the 

details  are  similar  to  the  above  with  the  control  again  given  by  (21) , 

dd> 
since  again  -rr  =  0.   Determinations  similar  to  the  above  yield 

n2(t)  =  -P4(t),  (32) 

and  that  it  is  necessary  on  the  constrained  subarc  that 


P4(t)  £  0,  (33) 


and 


a^-p^t))  £  a2(-p2(t)),  (34) 

the  latter  condition  (34)  being  a  consequence  of  Gamkrelidze ' s  multiplier 
condition  n9(t)  ^  0.   Corner  conditions  similar  to  (26)  through  (30)  also 
must  hold.   When  there  is  an  exit  from  the  constrained  subarc  at   t  =  t  , 
we  find  that 


fi 


t 

e 


y(t){a2p2(t)  -  alPl(t)}dt  £  0.       (35) 
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4.2.   Necessary  Conditions  of  Optimality  on  Constrained  Subarc  for  x  . 

On  a  constrained  subarc  on  which  x.  (t)  =  0  for   t  £  t  £.   T   the 

d2xl 
control  is  determined  by     g  =  0  which  (along  with  the  requirement  that 

dxl 

-7—  =  0)   yields 

dt 


and 


u*(t)  =0   for   t  <  t  $  I,  (36) 


(t)  =0  for  t  <:  t  £  T.  (37) 


8H 
The  multiplier  y(t)   is  determined  by  the  condition  -r—   =  0  and  hence 

oU 

p4(t)  =  y(t)axy  =  0,  (38) 

or 

P4(t) 
U(t)  =  -±—   .  (39) 

aly 

Differentiating  (38)  with  respect  to  time  and  combining  with  (15) ,  we 

obtain 

y(t)  =  ±   (a^-a^).  (40) 

Further  differentiation  and  combination  with  (12) ,  (13) ,  and  the  condition 

$(t)  =0  yields  that 

P-(t) 

y(t)  =  -f (a.b  -a,b,).  (41) 

a^      112  2 

The  necessary  condition  of  optimality  on  a  constrained  subarc  of  a  second 

k  dk 
order  SVIC  [22]  is  that   (-1)  — £  k  0   for  k  =  0,1,2,   and  hence  consid- 

dt 

ering  (39) ,  (40)  ,  and  (41)  we  must  have 


* 

This  result  was  obtained  after  cancellation  of  a  term  y(t) .   Hence,  a 

different  argument  is  required  when  y(t)  =  0.   This  latter  condition, 
however,  only  occurs  at  most  at  a  single  isolated  point   t  =  T.   See  [19] 
for  a  similar  occurrence  and  further  discussion. 
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P4(t)  *  0,  (42) 

a^-p^t))  ;>  a2(-p2(t)),  (43) 

and  a  b  ^  a2b?'  ^^ 

since  it  is  readily  shown  that  p~(t)  >  0  for   t  <  T.   It  is  of  interest 
to  note  that  (43)  and  (44)  have  previously  been  shown  [17]  to  be  necessary 
conditions  of  optimality  for  having  x  =  0   for  a  finite  interval  of  time 
in  the  inertialless  combat  problem  (1) . 

As  in  [14],  [19]  let  us  make  the  nonrestrictive  assumption  that 
a..b  >  a  b  .   This  then  implies  that  it  is  non-optimal  to  have  x  =  0 
for  a  finite  interval  of  time,  since  (44)  must  hold  with  the  sense  of  the 
inequality  reversed  on  such  a  constrained  subarc. 

For  a  second  order  SVIC   C(t,x  )  £  0,   we  must  have  at  an  entrance 
corner  to  a  constrained  subarc  [10] 

P^V-p^t*)  -V0f£-Cte)  -«(0^««<te))   f«   1=1 n,        (45) 

and 

H(t")  =  H(t+)  +  v  |£  (t  )  +  \i(t~)   j-   {(C)(t  )},         (46) 
e       e     Jdte       edt       e 

where  [6]   v  ^  0.   Recalling  (5)  and  (6),  we  find  for  the  problem  at  hand 

at  an  entrance  at   t  =  t   to  a  constrained  subarc  on  which  x.  =  0   that 

e  1 


the   following   hold 


Pl(tJ  =   Pl(t+)   +  V  (47) 

p,(t")  -   p.(t+)      for     i  =   2,3,  (48) 

1      e  l      e 

p4(te}  =   P4(te}    "  ^Ov*  (49) 
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From  (38)  we  have 


so  that  (49)  yields 


P4(t+)  =  y(t+)aiy,  (50) 


p4(te)  =  0.  (51) 

Furthermore,  the  corner  condition  (46)  (which  reads  H(t  )  =  H(t  ))   is 
satisfied  when  (48)  and  (49)  hold.   It  should  be  clear  that  no  trajectory 
can  leave  a  constrained  subarc  on  which  x..  =  0  after  entry.   Hence,  we 
omit  discussion  of  the  corner  conditions  at  exit  corners. 

It  will  now  be  shown  that  we  must  have  \>     >   0.   Considering  the 
nonrestrictive  assumption  a->b  >  a9^9>   it  should  be  clear  that  we  must 
have  x  ,x_,y,<j>  >  0   for  0  <  t  <  t    (we  might  have  <}>(t=0)  =  0).   Then 
c|>(t)  >  0   for  0  <  t  <  t   and  <$>(t)    =0  for   t  £  t  with  <J>(t)   being 
continuous  imply  that  u*(t  )  =  -JL  ,   and  thus  by  (17)  and  (51) 

p.(t)  <  0  for  t  6  (t  -6,t  )  with  p. (t~)  =  0,         (52) 
4  e    e  h      e 

where  6  >  0   is  a  suitably  chosen  constant,  and  we  have  used  the  fact 

that  it  is  impossible  for  P/(t)  =  0   for  a  finite  interval  of  time  (this 

was  established  when  we  showed  the  impossibility  of  a  singular  solution) . 

Expanding  p,  (t)   for   t  <  t   in  a  Taylor  series  about   t  =  t  ,   we  find 

that 

(t  -t)  -rf-   (t)  =  (-p.(t))  >  0,  (53) 

e    at  4 

dp4 

where  £  £  (t,t  ).   Hence,  -: —  (t)  >  0   for  all   t  6  (t  -6,  ,t  )   where 
e  at  e  I  e 

0  <  <S  :£  <5 .   Recalling  (15)  and  that  y  >  0,   we  have 

anpn(t)  >  a  p  (t)   for   t  6  (t  -5,  ,t  ),  (54) 

II       11  e  I  e 
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so  that  in  the  limit  we  have 


alPl(te}  *  ^l^'e*'  (55) 


Next,  we  show  that  we  must  have 


alPl(te}  >  a2P2(te)#  (56) 

The  proof  is  by  contradiction.   Considering  (55),  we  assume  that  a  p  (t  ) 

=  a  p„(t  ).   Again  expand  P/(t)   for   t  <  t   in  a  Taylor  series  about 

t  =  t  .   Recalling  (15)  and  using  the  above  assumption  and  (51)  ,  we  find 

that  for   t  <  t 
e 

(t  -t)2  d2p, 

.P4(t)  -— ^ — dt^(E)'  (57) 

where  t  6  (t,t  ).   Using  the  state  and  adjoint  equations,  we  readily 
compute  that 

d2p4 

-^-   (t)  =  yp3(t)(a1b1-a2b2)  -  {a^O-a^  (t) }  (b^+b^)  .   (58) 

By  the  continuity  of  the  dual  variables  between  corners,   t   can  be  chosen 

€(t  -<S,t  )   such  that  for  all   t  6  (t.t  )   we  have 
e    e  e 

and  hence  by  (57)  we  have  a  contradiction  to  (52).   Thus,  (56)  must  hold. 
Next,  we  show  that  we  must  have 

WO  *   a2P2(tt}'  (59) 

This  follows  immediately  from  p-(t)  >  0   for   t  <  T  =»  y(t)  >  0   for 

t  £  t  <  T.   Then  u(t=T)  £  0  =*  y(t)  <  0  for   t   <  t  <  T,   and  reference 
e  e 

to  (40)  yields  the  desired  result  (59) . 
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The  proof  that  v  >  0  now  readily  follows.   First,  we  observe 
that  (48),  (56),  and  (59)  yield 

Vl^?  "  a2P2(te}  =  a2P2(tt}  >  Vl^e^  (60) 

or 

Pl(te}  >  Pl(t0'  (61) 

whence  follows  v.  >  0  by  (47).   Moreover,   v   is  chosen  so  that  <j>  =  0 
precisely  when  x  (t  )  =  0. 

5 .   Synthesis  of  Extremal  Policy  when  x  (T) ,x?(T)  >  0. 

In  this  and  the  next  two  sections  we  synthesize  the  extremal  fire 
distribution  policy  for  all  cases  in  which  Y  loses  with  T  <  T     (the 
same  case  for  which  the  boundary  conditions  for  the  adjoint  variables  were 
given  in  Section  4) .   By  the  synthesis  of  the  extremal  control  we  mean  the 
explicit  determination  (using  the  necessary  conditions  of  optimality)  of 
the  time  history  of  the  extremal  control  from  initial  to  terminal  time 
(see  [17]-[19]  for  further,  more  extensive  discussion). 

The  basic  idea  is  to  trace  extremals    backwards  from  a  given 
terminal  state  in  such  a  way  as  to  guarantee  the  satisfaction  of  the  initial 


This  multiplier  Vq  arises  because  the  system  loses  two  degrees  of  freedom 
when  it  enters  the  constrained  subarc  (see  pp.  411-412  of  [10]).   For  a  first 
order  SVIC,  the  value  of  the  multiplier  y(t£)   at  the  entrance  corner  accounts 
for  the  loss  of  one  degree  of  freedom  by  the  system  upon  entering  the  con- 
strained subarc.   For  a  second  order  SVIC,  there  are  two  degrees  of  freedom 
lost  this  way  (for  the  problem  at  hand,   x^  =  0  and  <j>  =  0  on  the  constrained 


** 


subarc),  one  of  which  is  accounted  for  by  y(t'T) 


Other  cases  are  handled  in  a  similar  manner.   See  [18]  and  [19]  for  problems 
in  which  this  is  done  for  every  end  state  of  battle. 

*** 

By  an  extremal  we  mean  a  path  (or  trajectory)  on  which  the  necessary  condi- 
tions of  optimality  are  satisfied  at  every  point  in  time. 
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conditions.   Thus,  it  is  convenient  to  introduce  the  "backwards  time" 

ariable 

d*    d2 
dt2"  "  dx7" 


variable  x   defined  by  t  =  T  -  t.   We  observe  that  -—   =  -  —  but 

dt     dx 

It  is  also  convenient  to  define 


v(t)  =  a^-p^t))  -  a2(-p2(t)),  (62) 

so  that  differentiation  and  combination  with  (12)  and  (13)  yield 

dv 

—  =  -p3(t)(a1b1-a2b2).  (63) 

Then  our  nonrestrictive  assumption  that  a  b   >  a  b   yields  that 

4?  (t)  <  0   for  all   t  <  T,  (64) 

at 

since  it  is  easily  shown  that  p~(t)  >  0  for   t  <  T.   Using  (62),  it  is 
convenient  to  write  (15)  (for  x  >  0)   as 

dp4 

dT--yv,  (65) 

and  hence 

d2p4  dv 

Zr7~  =  (biVb2x2)v  '  y  dt"  (66; 

In  synthesizing  an  extremal  there  are  two  cases  to  consider: 


Case  (a)   a  p  ^  a  q, 


Case  (b)   a  p  <  a  q. 

For  Case  (a):   a,  p  ^  a„q,   it  is  convenient  to  first  observe  that 
a  Taylor  series  expansion  of   p,  (x)   about   x  =  0  yields  for   x  ^  0 

dp,  2  d2P< 

P4(x)  =  p4(x=0)  +  x  jf-   (x=0)  +  ±-  -^  (t=T)  ,  (67) 

where  x  6  (0,x).   In  this  case  we  have 


20 


v(t=0)  =  aLp  -  a2q  ^  0,  (68) 

so  that  considering  (64)  it  is  readily  seen  that 

v(t)  >  0   for  t  >  0,  (69) 

and  hence  (65)  and  (66)  yield 

■—■   (x=0)  =  y(T)(alP-a2q)  =  0,  (70) 

since  y(T)  =  0  and 

d*p 

-r-r1  (t)  >  0  for  x  >  0.  (71) 

ax 

Furthermore,  there  are  three  subcases  to  be  considered  when 

alP  *  a2q: 

Subcase  (al)   <j)(t=T)  =  0, 

Subcase  (a2)   0  <  <f>(t=T)  <  1, 

Subcase  (a3)   <<)(t=T)  =  1. 

We  will  now  show  that  Subcase  (al)  is  not  consistent  with  an  optimal  policy 
and  work  out  details  for  the  other  two  cases. 

Subcase  (al)  :   <}>(t=T)  =  0  when  a.,  p  ^  a?q. 

Since  $(t=T)  =  0,   (16)  yields  that  p,(x=0)  =  v  ;>  0.   Then  (67), 
(70) ,  and  (71)  yield  that 

P4(t)  >  0   for  t  >  0.  (72) 

If  the  system  would  be  on  a  constrained  subarc  for  a  finite  interval  of 
time,  i.e.  <f>(t)  =  0  for  t  £  t  £  T,  then  (72)  is  a  violation  of  the 
necessary  condition  of  optimality  (33).   (We  also  note  that  Gamkrelidze's 
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necessary  condition  n2(t)  £  0   is  violated,  since  by  (32),  (65)  and  (69) 
we  have  n  (t)  ■  yv(t)  >  0   for   t  £  t  <  T.)   If  we  were  not  on  a  con- 
strained subarc  for  a  finite  interval  of  time,  then  we  must  have  <J>(x)  >   0 
for  0  <  t  <  5  with  4>(t=0)  =  0.   This  implies  that   u*(t)  =  -FL   for 
0  £,  x   <   5  £  5,   and  this  is  impossible  by  (17)  and  (72).   Hence  this 
case  is  inconsistent  with  an  optimal  policy. 

Subcase  (a2)  :   0  <  <fr(t=T)  <  1  when  a  p  :>  a  q . 

Since  0  <  <|>(t=T)  <  1,   (16)  yields  p,(t=0)  =  0,   and  (72)  again 
follows.   Then  by  (17)  and  (72)  we  have 

u*(t)  =  R^   for  0  £  x  £  T.  (73) 

Denoting  <}>(t=0)   by  #_,   we  then  have  $  +  K.T   =  <j>(t=T)  <  1»   so  that 
this  case  happens  when  T  <  (1-tfO/IL  .   The  extremal  policy  is  then  given 
by 

u*(t)  =  R^   for   0  £  t  £  T  <  (l-^/Ry.  (74) 

For  longer  times  we  must  go  to  the  next  subcase. 

Subcase  (a3)  :   <f>(t=T)  =  1  when  a  p  ^  a  q. 

Since  (J>(t«T)  =  1,   (16)  now  yields   p  (x=0)  =  -v  £  0.   It  may  be 
shown  that  a  contradiction  arises  unless  \>      =  0.   Hence,  we  must  have 
p,  (t=0)  =  0.   If   <j»(t)  <  1   for  T  -  6  <;  t  <  T  where   6  >  0,   then  the 
development  of  the  previous  subcase  holds.   If  we  are  on  a  constrained 
subarc  with  <j>(t)  =  1   for   t  £  t  £  T ,   then  by  (24)  and  (62)  we  have 
n. (t)  =  -yv(t) .   Hence  Gamkrelidze's  necessary  condition   n7  (t)  ^  0   is 
satisfied  by  (69).   Thus,  we  can  remain  (in  backwards  progression  at  the 
end   t  =  T)   on  the  constrained  subarc  until  we  have  to  get  off  to  meet 
the  initial  condition  4>(t=0)  =  $«•   A.s  we  work  backwards  and  leave  the 
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constrained  subarc  (but  in  forwards  time  enter  at   t  ) ,   the  corner 

e 

condition  (27)  and  a  Taylor  series  expansion  of  p  (t)   about   t  =  t 

yield  for  0  £  t  &  t 

e 

dp,  (t  -t)2  d2p, 

P4(t)  "  -(Vfc)  IT  <«.>  +  -\~  dx^  (t=r)'  (75) 

where  t   £   (t,t  ).   Recalling  (65),  (69),  and  (71),  we  see  that  P,(t)  :>  0 
for  0  £  t  £  t   so  that  the  rest  of  the  analysis  is  similar  to  that  of 
the  preceding  subcase.   Hence 

when  T  ^  (l-<j>  )/lL  ,   we  have 

for   0  £  t  £  (l-^/Ry,   u*(t)  =  Ry   and  4>*(t)  =  <J>Q  +  B^t, 

for   (l-^0)/Ru  <  t  £  T,   u*(t)  =  0   and  <fr*(t)  -  1.  (76) 

For  Case  (b) :   a.p  <  a?q,   we  have  that  (recalling  (62)) 

v(x=0)  =  axp  -  a2q  <  0.  (77) 

From  (64)  we  have  -r-   (x)  >  0   for   x  >  0  with  v(x)   a  continuous 

dx 

function  (see  [2]  and  above  corner  conditions  at  boundary  of  state  space 
(26)),  and  thus  at  some  (backwards)  time  v(x)   must  become  zero.   Denote 
this  "backwards  time"  as   t.  .   Thus  v(x=x..)  =  0.   There  are,  again, 
three  subcases  to  be  considered  when  a  p  <  a  q: 

Subcase  (bl)   <J>(t=T)  =  0. 

Subcase  (b2)   0  <  4>(t=T)  <  1, 

Subcase  (b3)   4>(t=T)  =  1. 

Analysis  of  these  subcases  is  similar  to  that  given  for  Case  (a)  with 
Subcase  (b3)  being  impossible. 
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Let  us  now  observe  (recalling  (25)  and  (34))  that  in  order  to 
satisfy  Gamkrelidze's  condition  on  constrained  subarcs  with  <j)  =  0   or 
4>  *  1,   we  must  have 

v(t)  ^  0  when  <£(t)  =  1   for  a  finite  interval  of  time,      (78) 

and      v(t)  £  0  when  4>(t)  =  0   for  a  finite  interval  of  time.      (79) 

We  have  previously  noted  in  Section  4.1  the  correspondence  of  these  results 

to  those  for  "inertialless"  combat.   We  now  consider  the  case  when 

(J>(t=T)  =  0.   Let   t    denote  the  (forward)  time  when  the  system  enters 

e 

a  constrained  subarc  with  <f>  =  0 ;   similarly   t1   denotes  the  time  of 
leaving  one  with  <J>  =  1.   We  further  assume  that 

<|>(t)  =0  for   t°  a:  t  as  T.  (80) 

We  now  prove  that  it  is  impossible  to  have  v(t=t  )  =  0;   in  fact, 

v(t)   must  be   <  0  before  <f>  =  0 .   We  begin  by  observing  that   4>(t=t  )  =  0 

with  0  <  <b(t)  <  1   for   t°  -  6  <  t  <  t°  where   6  >  0.   Hence,   u*(t)  =  -R. 

e  e  L 

and  p.  (t)  SO  by  (17)  for   t°  -  6.  <  t  <  t°  where  0  <  6,  s;  6 .   Con- 
4  e    1        e  1 

sidering  a  Taylor  series  expansion  about   t  =  t    and  (27)  ,  (65)  ,  and  (66) , 

we  have  for   t  £  t 

e 

(t°-t)2  , 

p4(t)  =  (t°-t)y(t°)v(t°)  +  — ^ {(b1x1+b2x2)v(t')  -  y(~)  ^  (t)},     (81) 

where  ~  €  (t,t°).   Considering  that  ~  <  t  ,   (64),  and  (81),  it  is 
easily  seen  that  v(t  )  k  0  =>   p,  (t)  >  0   for   t  <  t  ,   which  is  impossible 


by  the  above.   Hence, 


v(t=t°)  <  0.  (82) 

e 
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Nov,  it  is  readily  shown  that  for   tj  <;  t  £  t  ,   P-(t)  =   y(s)v(s)ds 


so  that  recalling  (30) 


P4(tJ+)  = 


(83) 


Then,  the  continuity  of  v(t) ,   (64)  and  (82)  yield  that  v(tj)  >  0.   Denote 

the  time  at  which  v  =  0  as  T  -  t   so  that  v(t=T-x  )  =  0.   Then  since 

rt 


we  also  have  p, (t)  = 


y(s)v(s)ds,   it  follows  that   P,(t)  <  0  for 


tj  <  t  <  t°   and  hence  by  (17) 


ii*(t)  =  -IL   for   t*  <  t  < 


It  follows  that   t   -  tj  =  1/R_  .   The  times  t\      and   t   are  determined 

e    %  X  I  e 

by  the  conditions 


T  -  t1  €  (c£»t°)   where  v(t=T-T  )  -  0, 


(84) 


and 


t° 

re 


y(s)v(s)ds  =  0, 


(85) 


which  may  be  written  as 

T-x. 


y(t)v(t)dt  =  - 


t° 
re 


T-x. 


y(t)v(t)dt. 


(86) 


w£  *  '1 

The  relationship  of  the  times   tj  ,   T-x,  ,   and   t   to  the  time  histories 

x.      1         e 

of  <|)(t)   and  v(t)   are  shown  in  Figure  1. 
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<fr-l 

k 

t=T-T 

v    J 
v=0 

i 

t-t° 

e 

t=T             time,    t 

i ^ 

time,    t 

t=0 


Figure  1.   Relationship  of   tj ,   T-T- ,   and   t 
to  Values  of   <j>   and   v. 


Omitting  further  details,  we  reach  an  important  conclusion:   for  the 
"inertial"  combat  (2)  one  begins  to  redistribute  fire  earlier  in  forward 
time  (anticipating  changes  in  target  priority)  than  in  the  corresponding 
"inertialless"  case  (1) .   Again,  the  reader  is  referred  to  Figure  1  for 
motivation  of  this  statement. 

The  above  considerations  on  tracing  extremals  backwards  from  the 
various  terminal  states  of  battle  are  summarized  in  Table  I.   This  table 
shows  from  which  of  the  end  states  extremals  lead.   An  entry  like  "possible 
(but  not  too  likely)"  or  "possible  (but  unlikely)"  means  that  the  domain 
of  controllability  for  extremals   to  that  end  state  is  a  "rather  small" 


*By  the  domain  of  controllability  for  extremals  to  an  end  state  we  mean  the 
set  of  points  in  the  initial  state  space  from  which  extremals  lead  to  the 
terminal  state. 
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Table  I.   Possible  End  States  of  Battle  for 


Extremals  when  x..  (T)  ,x„  (T)  >   0. 


axp  ;>  a£q 


a1P  <  a2q 


4>(T)  =  0 

(on  constrained  subarc 

for  finite  time  interval) 


<KT)  =  0 
(but  4>(t)>0   for  T-<S<t<T) 


0  <  <t>(T)  <  1 


<J>(T)  =1 

(on  constrained  subarc 

for  finite  time  interval) 


<KT)  =  1 
(but  <J»(t)<l   for  T-6<t<T) 


impossible 

possible 
(and  quite  likely) 

impossible 

possible 
(but  unlikely) 

possible 
(but  not  too  likely) 

possible 
(but  not  too  likely) 

possible 
(and  quite  likely) 

impossible 

possible 
(but  unlikely) 

impossible 
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subset  of  the  initial  state  space  in  contrast  to  an  entry  like  "possible 
(and  quite  likely)"  for  which  the  corresponding  set  of  initial  conditions 
is  "much  larger."   It  has  not  been  possible  to  develop  explicit  formulas 
for  these  domains  of  controllability  (as  done,  for  example,  in  [14],  [19]). 

6.   Synthesis  of  Extremal  Policy  when  x  (T)  =  0   and   x?(T)  >  0. 
There  are  two  cases  to  consider: 

Case  (a)   on  constrained  subarc  for  finite  interval  of  time, 

Case  (b)   x  (t)  >  0   for   t  <  T. 

For  Case  (a)  :   x..(t)  =  0   for   t  £  t  £  T  with   t   <  T,   it  is 
clear  that  we  must  have  4>(t)  =  0   for   t  £  t  £   T   so  that  integration 
of  the  adjoint  equations  (13)  through  (15)  and  (63)  with  the  boundary 
conditions  (16)  yields  that  for   t   £  t  :£  T  we  have 


p2(t)  =  -q  cosh/a^  (T-t),  (87) 


P3(t)  =  q  /^  sinh  /a2b2  (T-t),  (88) 

v(t)  =  v(T)  +  t^  (a.b  -a.bJ{cosh  /a.b.  (T-t)  -  1}   with  v(T)  £  0,    (89) 

the  requirement  that   v(T)  ^  0   being  a  consequence  of  (43) .   We  also  have 

by  (40)  and  (62)  that 

T 

u(t)  m   y(T)  +—   v(s)ds   with   u(T)  ^  0.  (90) 

al  J 

t 

Recalling  our  nonrestrictive  assumption  that  ±->]i->    >   £2— ?   anc^  (39)  , 
consideration  of  (89)  yields  that 


28 


p  (t)  >  0   for   t.  £  t  <  T  with  p  (t=T)  =  0, 
and  v(t)  >  0   for   t-  si  t  <  T  with  v(t=T)  ^   0. 

Hence,  we  see  that  the  necessary  conditions  of  optimality  (42)  through 
(44)  are  always  satisfied  on  the  constrained  subarc.   Thus,  there  are  no 
restrictions  on  when  such  an  extremal  can  occur. 

We  next  show  that  we  must  have  v(t..)  <  0.   The  proof  is  by  con- 
tradiction.  First,  we  observe  that  x  (t)  >  0  for   t  <  t..   and  x  (t=t1)  =  0 
imply  that 

u*(t)  =   -\      for   t   -  6  <  t  <  t   where  6  >  0.         (91) 

If  we  had  v(t")  £  0,   then  (63)  and  the  fact  that  p~(t)  >  0  for   t  <  T 
(which  follows  from  (88))  yield  that  v(t)  >  0   for   t  <  t  .   This  yields 
p,(t)  >  0   for   t  <  t1   by  (65)  and  the  condition  (51)  that  PaCO  ■  0. 
However,  (91)  is  impossible  if   P/(0  >   0. 

It  should  be  noted  that  v(t  )  <  0  and  v(t  )  >  0  guarantee  that 
vQ  >  0,    since  from  (47),  (48),  and  (62)  it  follows  that 

U   a..     i       1 

It  should  be  recalled  that  v_   is  chosen  so  that  $   =  0  when  x.(t')  =  0. 
The  value  of  vn  depends  on  x1  ,   x»,   yn ,   <j>n,   and  T,  .   The  multiplier 
v1   is  unrestricted  and  is  chosen  so  that  v(T)  =  (a.p-a  q)  -  a  v  ^0. 

Considering  the  corner  conditions  (47) ,  (48) ,  and  (51) ,  we  have  shown 
that 


* 
This  condition  was  shown  to  be  necessary  for  optimality  in  Section  4.2. 
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P1(t1)  <  0,    P2(tL)  <  0,   p3(tl)  >  0,   P4(t1)  =  0,   v(tx)  <  0.    (92) 

Further  details  for  0  ^  t  <  t   in  the  backwards  synthesis  are  similar 
to  those  given  for  Case  (b)  in  Section  5  and  are  therefore  omitted.   Some 
possibilities  for  the  synthesized  extremal  control  are  shown  in  Table  II. 
Other  variations  in  the  form  of  the  synthesized  extremal  control  are 
possible,  and  the  reader  should  have  no  difficulty  in  identifying  them. 

For  Case  (b)  :   x..  (t)  >  0   for   t  <  T ,   there  are  three  subcases  to 
be  considered: 

Subcase  (bl)   <fr(t=T)  =  0, 
Subcase  (b2)   0  <  <J>(t=T)  <  1, 
Subcase  (b3)   (j)(t=T)  =  1. 

Further  analysis  yields  the  results  shown  in  Table  III.   This  shows  from 
which  of  the  end  states  extremals  lead.   We  will  now  sketch  how  these 
results  were  obtained. 

Subcase  (bl) :   <fr(t=T)  =  0. 

Since  <j>(t=T)  =  0,   (16)  yields  that  p.  (x=0)  =  v,  ^  0.   It  is  clear 
that  we  must  have   <f>(t)  >  0   for   t  €  (T-6,T)   for  some   6  >  0.   Hence, 
by  (17)  and  (20) 

u*(t)  =  -^   for   t  €  (T-61,T)  C  (T-6,T).  (93) 

We  now  prove  that   p , (t=0)  =  0.   The  proof  is  by  contradiction.   If 

p, (t=0)  >  0,   then  p  (t)  >  0   for   t  €  (T-6„,T)   and  this  contradicts  (93) 


To  establish  this  result  one  makes  the  identification  p,  (t)  =  .  ,    s  , 
where  W  denotes  Bellman's  optimal  value  function.   It  follows   1 
that   p,(t)  <  0,   since  addition  of  more  X   at   t   cannot  help  but  reduce 
Y's   optimal  return.   A  justification  of  this  argument  is  given  in  [18]. 
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Table  II.   Some  Possibilities  for  Synthesized  Extremal  Controls  and 
Collateral  Information  for  Case  in  which  x^ (T)  =  0   (On 
Constrained  Subarc  for  Finite  Time  Interval;  and  x-(T)  >  0 
(Must  Have  v(T)  ;>  0)  . 


Case   (1) . 

time,    t 

?+W 

v(t) 

u*(t) 

♦w 

x1(t) 

t=0 

>0 

>0 

*u 

0S*<1 

>0 

0<t<tx 

e 

>0 

>0 

*u 

0<Q><1 

>0 

t-t1" 

e 

0 

>0 

«o 

1 

>0 

t=t1+ 

e 

>0 

>0 

0 

1 

>0 

tx<t<tj 

>0 

>0 

0 

1 

>0 

e         I 

1+ 

0 

>0 

0 

1 

>0 

0 

>0 

-\ 

1 

>0 

ti<t<T-rx 

<0 

>0 

-\ 

0«J><1 

>0 

t»T-T 

<0 

0 

-\ 

0<<{><1 

>0 

T-T    <t<t1 

<0 

<0 

-\ 

0«fr<l 

>0 

t=ti 

0 

<0 

-*L 

0 

0 

t-4 

>0 

>0 

0 

0 

0 

t  <t<T 

>0 

>0 

0 

0 

0 

t«T 

0 

fcG 

0 

0 

0 

Case    (2). 

time,    t 

^(t) 

v(t) 

u*(t) 

too 

xx(t) 

t=0 

>0 

>0 

*u 

0s-j><l 

>0 

0<t<t 

s 

>0 

>0 

*u 

0<<j><l 

>0 

t=t 

t=t 
s 

0 

>0 

h 

0<<j)Sl 

>0 

0 

>0 

-*L 

0<(J)Sl 

>0 

ts<t<T-x1 

<0 

>0 

-*L 

0<<J><1 

>0 

t=T-Tl 

<0 

0 

-h 

0«jxl 

>0 

T-T.<t<t1 

<0 

<0 

-\ 

0<<J><1 

>0 

*-h 

0 

<0 

-\ 

0 

0 

t-4 

>0 

>0 

0 

0 

0 

t   <t<T 

>0 

>0 

0 

0 

0 

t=T 

0 

feO 

0 

0 

0 

Case   (3). 

time,    t 

£4(t) 

v(t) 

u*(t) 

<Kt) 

xx(t) 

t=0 

SO 

>0 

-\ 

0<(j)SLL 

>0 

0<t<T-Tl 

<0 

>0 

-\ 

0<<j><l 

>0 

t=T-Tl 

<0 

0 

-\ 

0<<J><1 

>0 

T"T1<t<t1 

<0 

<0 

-*L 

0<4><1 

>0 

t-t£ 

0 

<0 

"*L 

0 

0 

«t 

>0 

>0 

0 

0 

0 

t   <t<T 

>0 

>0 

0 

0 

0 

t-T 

0 

iO 

0 

0 

0 

31 


Table  III.   Possible  End  States  of  Battle  for 

Extremals  when  x  (T)  =  0   (But   x  (t)  >  0 
for   t  <  T)   and   x  (T)  >  0. 


alP  ^  a2q 


a]_P  <  a2q 


<f>(T)  =  0 

(on  constrained  subarc 

for  finite  time  interval) 


+  (T)  =0 
(but  4>(t)>0   for  T-6<t<T) 


0  <  $(T)  <  1 


<KT)  =  1 

(on  constrained  subarc 

for  finite  time  interval) 


<KT)  *»  1 

(but   4>(t)<l   for   T-6<t<T) 


impossible 

impossible 

possible 
(but  very  unlikely) 

possible 
(but  very  unlikely) 

possible 
(but  not  too  likely) 

possible 
(but  not  too  likely) 

possible 
(and  quite  likely) 

impossible 

possible 
(but  unlikely) 

impossible 
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by  (17).   Furthermore,  we  must  have  v(T)  <  0.   Again,  the  proof  is  by 

3W 
contradiction.   First,  we  remark  that  by  considering  P-,(t)  ■  - — 7—7  (see 

[18]  for  further  discussion)  we  obtain  that  p~(t)  >  0   for   t  <  T.   Thus, 

by  (63)  we  have  —  (t)  >  0   for  t  >  0.   If  we  had  v(T)  :>  0,   then  we 
dT 

would  have  v(t)  >  0   for   t  <  T,   and  hence  p,(t)  >  0   for   t  €  (T-52,T) 
via  (65)  and  p,(T)  ■  0.   However,  by  (17)  this  contradicts  (93). 

By  (16)  we  see  that  v(T)  <  0  =* —  (a.  p-a„q)  <  v,  £  p,   the  latter 

a    1    I  I 

inequality  a  consequence  of  requiring  p,  (T)  £  0.   It  is  clear  that  v.. 
can  always  be  chosen  to  satisfy  the  above  conditions  regardless  of  whether 

a.p  S:  a?q  or  a.,  p  <  a„q.   Thus,  this  subcase  is  always  possible  for  the 
appropriate  initial  values  of  the  state  variables.   Furthermore,  we  have 

Pl(T)  £  0,   p2(T)  <  0,   p3(T)  *=  0,   p4(T)  =  0,   v(T)  <  0, 

so  that  the  synthesized  extremal  control  function  can  take  any  of  the  forms 
shown  in  Table  III  for  0  £  tf  £  1. 1      the  realization  of  any  particular 
form  being  dependent  upon  the  state  variable  initial  conditions. 

Subcase  (b2)  :   0  <  <j»(t=T)  <  1. 

Since  0  <  <f>(t=T)  <  1,   (16)  yields  that   p  (t=0)  -  0.   Moreover, 
for  0  <  <$>(T)  ^  1   the  transversality  condition  H(T)  =  0  no  longer  holds, 
since  when  xn (T)  =  0   (=y(T)),   variations  in  control   6u  cannot  increase 
T  because  this  would  lead  to  violation  of  the  constraint  x  ^  0   if  the 
planning  horizon  were  extended  to  T  +  dT  with  x  (T)  =  0  and  cj>(T)  >  0. 
Then,  a  one-sided  version  of  the  usual  variational  argument  [2]  yields 
(after  dropping  some  terms)   dJ  =  H(T)dT  £  0  with  dT  2:  0 ,   where  J 
denotes  the  (augmented)  return  functional.   This  yields  H(T)  ^  0  and 
consequently  p-(T)  ^  0.   Again  using  the  argument  which  considers 
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3W 
P3(T)  =  a  (T)   [18],   it  follows  that   p  (T)  =  0   (and  also   p  (t)  >  0 

for   t  <  T)  . 

Now,   v(t=0)  =  (a  p-a^q)  -  a  v   where  0  £  v  £  p,   the  latter 
inequality  being  a  consequence  of  requiring   p,  (T)  :£  0.   For   a  p  k  a„q, 
we  can  have  v(x=0)   either  ^  0  or  <  0.   When  v(t=0)  ^  0,   the  resulting 
synthesized  extremal  control  takes  a  form  similar  to  that  shown  in  Table  IV 
below.   When  v(t=0)  <  0,    it  takes  any  of  the  forms  shown  in  Table  II 
for  0  £  t  £  t- .   In  all  cases  v   is  chosen  so  that  x  (T)  =  0.   The 
realization  of  the  synthesized  extremal  control  depends  upon  the  state 
variable  initial  conditions. 

Subcase  (b3)  :   <f>(t=T)  =  1. 

Since  <Kt=T)  =  1,   (16)  now  yields   p,(t=0)  =  -v  £  0.   If  <|>(t)  <  1 
for  T  -  6  £  t  <  T  where  6  >  0,   then  previous  arguments  (i.e.  proof  by 
contradiction)  yield  that  p, (t=0)  =  0.   Next,  we  show  that  we  must  have 
v(t=T)  ^  0.   The  proof  is  by  contradiction.   First,  we  observe  that 
4>(t)  <  1   for  T  -  6  £  t  <  T  implies  that 


u*(t)  =  Ry   for   t  €  (T-6rT)  C  (T-<S,T) 


(94) 


If   v(T)  <  0,   then  v(t)  <  0   for   t  €  (T-62,T).   Considering  a  Taylor 
series  expansion  about   t  =  T,   we  obtain  p, (t)  =  (T-t)y  (t)  v(t)   where 
t  €  (t,T),   since   p4(T)  =  0.   But  then   t  €  (T-52>T)  H  (T-6;L,T)   yields 
p,(t)  <  0  and  this  contradicts  (94)  by  (17).   Since  v(T)  =  (a^-a^)  - 
a  v  ^  0  where  \>     ^  0,   it  follows  that  this  subcase  with  <j>(t)  <    1 
for   t  <  T   is  only  possible  when  a  p  ^  a  q. 

If  we  are  on  a  constrained  subarc  with  <J>(t)  =  1   for   te  ^  c  ^  ^' 
then  (23)  and  (25)  must  hold.   The  former  yields   p  (t=0)  =  0,   while  the 


This  is  the  only  case  possible  when  a  p  <  a  q. 
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latter  yields  0  £  v  £ —  (a^p-a^)   so  that  this  subcase  with  <{>(t)  =  1 
for   t1  £  t  £  T   is  only  possible  when  a  p  ^  a„q.   The  further  synthesis 
of  the  extremal  control  now  follows  previous  arguments.   The  synthesized 
extremal  control  is  shown  in  Table  IV. 


Table  IV.   Synthesized  Extremal  Control  and  Collateral  Information 

for  Case  in  which  x. (T)  -  0   (But  x  (t)  >  0   for   t  <  T) 

and  x_(T)  >  0  with  <j>(t)  =  1  for  t1  £.   t  <.   T. 
z                          e 


time,  t 

£4!^ 

v(t) 

u*(t) 

<f>(t) 

*il$2 

t=0 

>0 

>0 

\ 

0^4)<1 

>0 

0<t<tx 
e 

>0 

>0 

\ 

0«j)<l 

>0 

t-t1" 

0 

>0 

0 

1 

>0 

e 

>0 

>0 

0 

1 

>0 

t*<t<T 
e 

>0 

>0 

0 

1 

>0 

t=T 

0 

^0 

0 

1 

0 

Note:   This  case  is  only  possible  when  a.,  p  ^  a„q. 


7 .   Synthesis  of  Extremal  Policy  when  x  (T)  >  0  and  x^CT)  =  0. 

When  x  (T)  =0,   we  must  have  x  (t)  >  0   for   t  <  T,   since  it 
is  nonoptimal  (see  Section  4.2  above)  to  be  on  a  constrained  subarc  with 
x„  =  0   for  a  finite  interval  of  time.   There  are  then  three  cases  to  be 
considered: 

Case  (a)   $  (t-T)  =  0, 

Case  (b)   0  <  <f>(t=T)  <  1, 

Case  (c)   <j>(t=T)  =  1. 
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Further  analysis  now  yields  the  results  shown  in  Table  V.   This  shows 
from  which  of  the  end  states  extremals  lead.   The  symmetry  of  these  results 
(interchange  x   and  x  )   in  relation  to  those  shown  in  Table  III  for 
x.(T)  =  0  but   x  (t)  >  0   for   t  <  T   should  be  noted.   We  will  now  sketch 
how  these  results  were  obtained. 


For  Case  (a):   <b(t=T)  =  0,   (16)  yields  that  p.  (x=0)  =  v.  ^  0. 

-c ^4  4 

Observing  that  <j>(t)  >  0   for   t  €  (T-6.T)   again  implies  (93),  it  follows 

dp4 
by  previous  arguments  that  p,(x=0)  =  0.   Also,  (93)  implies  that  -: —  (t)  <  0 

for  t  €  (0,62)  C  (0,61),   and  hence  v(t=0)  <  0  by  (65).   Observing  that 

v(t=0)  =  (a  p-a  q)  +  a  v„,   it  follows  that  this  case  with  <}>(t)  >  0   for 

t  <  T   is  only  possible  for   a  p  <  a  q.   The  synthesized  extremal  control 

takes  the  form  shown  in  Table  II  for  0  £  t  £  t  . 

If  we  are  on  a  constrained  subarc  with  <£(t)  =  0   for   t  £  t  £  T, 

then  (33)  and  (34)  must  hold,  whence  it  follows  that  p  (x=0)  =  0  and 

v(x=0)  =  (a.p-a.q)  +  a_v_  £  0   or   0  <;  v0  £  —  (a  q-a  p)  .   Then   p  (t=0)  = 

-q  +  v   <  0   so  that  we  must  have  v(T)  <  0,   since  for   t  £  t  £  T  we 

2  (q-vJ        t 

have  v(t)  =  v(T)  +  (a  b^a^)  -g — —  (cosh  /a^  (T-t)  -  1}   and  v(t)  ^  0 

by  (34)  and  (62).   Thus,   0  <:  v  <  —  (a  q-a  p)   so  that  this  case  with 

L       a_   z   J. 

<|)(t)  =0   for   t°  ^  t  ^  T   is  only  possible  when  a  p  <  a2q.   The  synthesized 
extremal  control  is  shown  in  Table  VI. 

For  Case  (b)  :   0  <  4>(t=T)  <  1,   (16)  yields  that   ?4(t=0)  =  0.   If 

a  p  s>  a  q,   then  v(T)  k  0.   The  requirement  that   p2(T)  s:  0  yields 

dv  /  N 
0  £  v „  aS  q  so  that  by  (14)  we  have  p_(t)  >  0   for   t  <  T,   whence  —  (x) 

>  0   for  all   x  >  0.   Hence,   v(x)  >  0   for   x  >  0,   and  the  synthesized 
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Table  V.   Possible  End  States  of  Battle  for 

Extremals  when  x.(T)  >  0  and  x  (T)  =  0 
(But  x2(t)  >  0  for   t  <  T) . 


a1p  :>  a2q 


axP  <  a2q 


<KT)  =0 

(on  constrained  subarc 

for  finite  time  interval) 


♦  (T)    =  0 
(but     <f>(t)>0      for     T-<5<t<T) 


0  <   <j»(T)    <   1 


♦  (T)    «  1 

(on  constrained  subarc 

for  finite  time  interval) 


4>(T)    =   1 

(but     <Kt)<l     for     T-<S<t<T) 


impossible 

possible 
(and  quite  likely) 

impossible 

possible 
(but  unlikely) 

possible 
(but  not  too  likely) 

possible 
(but  not  too  likely) 

impossible 

impossible 

possible 
(but  very  unlikely) 

possible 
(but  very  unlikely) 
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Table  VI.   One  Possibility  for  Synthesized  Extremal  Control  and 

Collateral  Information  for  Case  in  which  x  (T)  >  0 

and  x  (T)  ■  0   (but  x  (t)  >  0  for   t  <  T)   With 

<J>(t)  =0   for   t°  £  t  £  T. 
e 


time,    t 

R^m 

v(t) 

u*(t) 

<r(t) 

x2(t) 

t*0 

>0 

>0 

*u 

Osajxl 

>0 

0<t<t: 
e 

>0 

>0 

*u 

0<<J><1 

>0 

t=t1_ 
e 

0 

>0 

h 

1 

>0 

e 

>0 

>0 

0 

1 

>0 

t1<t<t01 
e          I 

>0 

>0 

0 

1 

>0 

«i" 

0 

>0 

0 

1 

>0 

-f 

0 

>0 

-*L 

1 

>0 

tJ<t<T-T. 

l            1 

<0 

>0 

-\ 

0<4><1 

>0 

t-T-T. 

<0 

0 

-\ 

0«fr<l 

>0 

T-T     <t<t 

1          e 

<0 

<0 

-h 

0<4><1 

>0 

o- 

t=t 

e 

0 

<0 

-\ 

0 

>0 

e 

<0 

<0 

0 

0 

>0 

t   <t<T 
e 

<0 

<0 

0 

0 

>0 

t=T 

0 

<0 

0 

0 

0 

Notes:   (1)   This  case  is  only  possible  when  a  p  <  a_q. 

(2)   Variations  In  the  synthesized  extremal  control  analogous 
to  those  shown  in  Table  II  are  possible. 
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control  is  as  shown  in  Table  IV  for  0  £  t  £  t ' .   If  a  p  <  a  q,   then  we 
can  have  either  v(T)  k  0  or  v(T)  <  0.   The  former  case  has  just  been 
discussed,  while  the  latter  case  yields  a  synthesized  extremal  control  like 
one  of  those  shown  In  Table  II  for  0  £  t  £  t„  . 


For  Case  (c)  :  <j>(t=T)  =  1,   (16)  yields  that  p  (t=0)  =  -v  £  0. 
It  is  clear  that  we  must  have  <j)(t)  <  1  for   t  €  (T-6,T),   since  x  (t)  >  0 
for   t  <  T  and  x_(T)  =  0.   Previous  arguments  readily  yield  p, (x=0)  =  0 
and  0  a:  V-  S  q   (since  P2(T)  £0).   If  a  p  ^  a  q,   then  v(T)  2:  0  and 
further  results  are  similar  to  those  of  the  previous  case.   If   a  p  <  a  q, 
then  v(T)   is  either  ^  0  or  <  0.   The  case  in  which  v(T)  £  0  has 
just  been  discussed.   If  v(T)  <  0,   this  leads  to   p, (t)  <  0   for 
t  €  (T-<S_,T).   By  (17)  this  is  impossible,  however,  since  we  must  have  (94) 
hold,  since  <J>(t)  <    1   for   t  6  (T-<S,T).   Thus,  this  case  is  always  possible 
for  the  appropriate  initial  values  of  the  state  variables,  although  we  must 
always  have  v(T)  £  0.   The  synthesized  extremal  control  is  the  same  as  for 
Case  (b)  with  v(T)  ^  0. 


8.   Determination  of  the  Optimal  Fire  Distribution  Policy. 

It  remains  to  discuss  how  the  optimal  fire  distribution  policy  may 
be  determined  from  among  the  extremal  control  policies  developed  in  the 
previous  sections.   Two  ways  of  proving  the  optimality  of  an  extremal 
trajectory  are  as  follows  (see  [17],  [18]): 

(a)  show  that  sufficient  conditions  of  optimality  are  satisfied  on 
the  extremal, 

(b)  by  citing  the  appropriate  existence  theorem,  show  that  an  optimal 
control  exists  for  the  problem  at  hand;  there  are  two  further 
subcases:   (1)   if  the  extremal  Is  unique,  then  it  is  optimal  or 
(2)   if  the  extremal  is  not  unique  and  only  a  finite  number  exist, 
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then  the  optimal  trajectory  is  determined  by  considering  the 
finite  number  of  alternatives. 

In  the  first  case  there  are  both  local  and  global  sufficiency  theorems 

to  be  considered.   Neither,  however,  is  convenient  to  apply  to  the  problem 

(2)  at  hand.   The  usual  control  theory  sufficient  conditions  for  a  local 

maximum  (see  pp.  181-184  of  [2])  are  not  satisfied,  since  the  problem  (2) 

is  singular  (in  the  sense  that  -r— ?  =   0)  .    Sufficient  conditions  for  a 

du 

global  maximum  depend  upon  the  appropriate  functions  being  concave  and 
the  planning  horizon  being  fixed  in  length  [4],  [9].   The  latter  condition 
is  not  satisfied  for  (2),  since,  for  example,  the  battle  can  end  by  Y 
being  annihilated  at  any  time  before  T  . 

Thus,  as  in  earlier  papers  [14],  [18],  [19],  we  are  again  led  back 
to  the  enumeration  of  all  extremals  in  various  regions  of  the  state  space 
(i.e.  the  intersections  of  the  domains  of  controllability  for  extremals 
leading  to  the  various  terminal  states  of  battle  (see  [14],  [19]))  and 
comparison  of  corresponding  returns.   The  author  has  not  been  able  to  develop 
analytic,  closed-form  results  for  the  integration  of  the  state  equations  (2) 
in  the  general  case  (much  less  for  the  determination  of  domains  of  con- 
trollability or  computation  of  extremal  returns).   Thus,  it  has  not  been 

possible  to  analytically  determine  the  optimal  control  from  among  the  candi- 

** 
date  extremal  controls  for  the  problem  at  hand   as  was  done  in  [14]  (see 

also  [19]). 

Moreover,  the  existence  of  an  optimal  control  readily  follows  from 

the  control  variable  u  being  uniformly  bounded  (see  Corollary  2  on  p.  262 


* 

Recently,  conditions  sufficient  for  a  local  maximum  have  been  developed  for 

the  singular  control  problem.   These  are,  however,  essentially  impossible 

to  check  for  the  problem  at  hand  (see  [18]). 
** 

For  given  initial  and  parameter  values  this  may  be  done  numerically  by 

following  the  steps  outlined  in  [14]  (or  [15]). 
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* 
of  [8]).   The  uniform  boundedn.es s  of  responses  is  a  consequence  of  u 

being  required  to  lie  in  a  compact  set  (i.e.   -»  <  -EL  £  u  £  IL  <  +». 
Let  us  observe  that  for  -R_  =  -°°  and  R_  =  -H»  problem  (2)  is  equivalent 
to  problem  (1) ,  the  "inertialless"  combat  problem.   In  this  case  there 
will  be  jumps  in  state  variable  <j>   in  problem  (2)  .   Additional  consid- 
erations are  now  required  in  the  development  of  necessary  conditions  of 
optimality  (see  [12],  [23],  [24]).   It  should  be  pointed  out,  moreover, 
that  the  existence  theorems  of  Lee  and  Markus  [8]  (and  others)  only  apply 
for  admissible  trajectories  which  are  absolutely  continuous.   Hence,  they 
no  longer  can  be  Invoked  to  insure  the  existence  of  an  optimal  control  when 
-IL  =  -oo  and  FL  =  +». 

9.   More  General  Redistribution  of  Fire  Model. 

In  the  model  (2)  considered  above  the  rate  of  redistribution  of 
fire  by  Y   firers  was  assumed  to  be  bounded,  i.e.   -R.  <.  -j-~  ^  R^  ,   and 
not  to  be  dependent  upon  the  distribution  of  fire.   It  is  of  considerable 
interest  and  importance  for  us  to  continue  our  investigation  of  the 
dependence  of  the  structure  of  the  optimal  fire  distribution  policy  upon 
model  form  (see  [16]).   The  above  simple  model  (2)  for  redistribution  of 
fire  is  equivalent  to 

where  (a)   F  (u=0)  =  0  and  (b)   F  (u)   is  a  concave  function  for  u  €  [-R,  ,RjT] 
with  F'(u=R_)  >  0. 

Let  us  consider  the  more  general  case  in  which  the  rate  of  redistri- 
bution of  Y-fire  is  bounded  and  also  dependent  upon  the  distribution  of 
fire.   Thus,  we  consider  the  model 

See  also  [17],  [18]. 
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j£  =  F(*»^)>  (96) 

where  F(4>,u)   reflects  the  ability  of  Y   firers  to  redistribute  their 
fire  over  X-target  types.   We  will  see  below  that  the  functional  form  of 
F(<j>,u)  (as  long  as  it  is  not  pathological)  does  not  affect  the  structure 
of  the  optimal  fire  distribution  policy,  although  the  time  history  of  the 
distribution  of  Y-fire,  i.e.   <j>(t)   for   0  £  t  £  T,   does  vary,  of  course, 
with  changes  in  F(4>,u). 

What  are  appropriate  properties  to  postulate  for  F(<f>,u)   in  order 
to  model  the  real  world?   First  of  all,  it  must  be  possible  to  keep  the 
distribution  of  fire  constant.   Thus,  we  stipulate  that  if   u  =  0,   there 
is  no  redistribution  of  fire.   Also,  if   u  >  0   (<0)  ,   then  <J>   increases 
(decreases)  (but  the  rate  of  change  is  bounded) .   It  seems  appropriate  to 
postulate  that  there  are  "stragglers"  in  redistributing  their  fire,  as 
$   approaches  zero  (or  one)  its  rate  of  change  decreases.   Finally,  we 
assume  that  all  the  Y   firers  can  shift  their  fire  in  a  finite  interval 
of  time. 

We  therefore  make  the  following  assumptions  about   F(<j>,u): 

(Al)  for  all  <|>  £    [0,1] 

!-F   <;  F(<J>,u)  <  0   for   -£L  £  u  <  0, 
F(<J>,u)  =0   for   u  =  0, 
0  <  F(<J>,u)  <:  F  for  0  <  u  <;  R^ 

where  F  ,F   >  0, 

(A2)   F(<j>,u)   is  piecewise  C     in  its  arguments  for  all 


4>  6  [0,1]   and  u  6  ["^.^1 
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(A3)   for  fixed  <j>  6  [0,1],   F(4>,u)   is  concave  in  u  for 
-\*»*h     WUh  |^(*.u-V  >0, 


(A4)   F(<j>,u)   is  a  strictly  /^ncreasin§)   function  of 

{ decreasing j 

fixed  u  such  that  <  ~L     *    >  . 


<J»   for 


An  example  of  such  a  function  is 


F(4>,u)  = 


u(e  +<j>)    for  -IL  £  u  <  0, 


u(l+e_-<j>)   for  0  £  u  <:  R_T, 
i  U 

where  e  ,e  >  0. 


(97) 


Then  we  have,  for  example,   <J>(t)  m   -£-,  +  (e..+<|>n)e 
-R_   for   tfl  £  s  £  t. 


•RL(t"C0) 


when  u(s)  = 


We  now  show  that  for  problem  (2)  when  (96)  is  used  with  F(<J>,u) 

satisfying  (Al)  through  (A4) ,  the  structure  of  the  optimal  fire  distribu- 

*  d<b 

tion  policy  is  the  same  as  for  problem  (2)  (with  ■—£■  =■  u)  .   The  Hamiltonian 

in  this  case  is  given  by 


H(t,x1,p1,u)  =  -p1(()a1y  -  p2(l-<fr)a2y  -  p3(b1x1+b2x2)  +  p4F(<j>,u) 


-  nx(t)u  +  n2(t)u  -  u(t){F(<j),u)a1y  -  ^  (b^+b^)  }  ,   (98) 


so  that  (15)  is  now  replaced  by 


dp 


4  3F        ( dF  \ 

r =  piV  "  p2a2y  -  p4  a? +  y(t)W  aiy  *  ai(bixi+b2x2);  •     (99) 


Of  course,  <p(t)      and  dependent  quantities  differ  in  their  particular  form. 
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When  x   ,x   ,y   >  0  and  0  <  <J>  <  1,   the  extremal  control  is  again 

given  by  (17)  because  of  assumption  (A3).  Again,  it  will  be  shown  that 

3F 
there  is  no  singular  solution.   This  is  proved  as  follows.   Since  ^r"  t   0 , 

3u 
3H 
the  condition  —  =  0   for  a  finite  interval  of  time  again  leads  to  (18) 

and   d^lhp  =  °   °n  the  sin8ular  subarc  so  that  (19)  again  follows.   For 

an  arc  on  which  (18)  and  (19)  hold  we  again  have  (20)  so  that  -7^7- (4^)  t   0, 

dt*-  ^du-' 

since  it  has  been  shown  that   p»(t)  >  0   for   t  <  T,   and  we  have  assumed 


no 


a.b   >  a  b  .   It  follows  then  that  there  is  no  singular  solution  (i.e 

singular  subarc  in  the  solution) . 

The  analysis  of  constrained  subarcs  also  follows  that  given  above 

for  problem  (2) .   We  illustrate  this  for  a  constrained  subarc  on  which 

x.  (t)  -  0   for   t  s  t  as  T.   Again,  (36)  and  (37)  hold.   Now 
1  e 

3H    3F  ,       . 
lu"  =  an-  (P4"Maly)  ' 

r)H  3F 

so  that  —  =  0   again  yields  (38)  ,  since  —  ^0.   It  is  readily  shown 

dU  oil 

in  a  similar  fashion  that  (40)  and  (41)  still  hold  so  that  the  necessary 
conditions  of  optimality  on  a  constrained  subarc  with  x1 (t)  =  0   for  a 
finite  interval  of  time  are  again  given  by  (42)  through  (44) .   Treatment 
of  other  types  of  constrained  subarcs  is  similar  and  further  discussion 
is  omitted. 

Thus,  we  have  shown  that  the  characterization  of  an  optimal  fire 
distribution  policy  for  the  more  general  redistribution  of  fire  model 
given  above  is  exactly  the  same  as  that  for  problem  (2) . 

10.   Discussion. 

In  this  section  we  discuss  what  we  have  learned  about  the  influence 
of  command  and  control  limitations  on  the  structure  of  optimal  fire 
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distribution  policies.   The  reader  should  bear  in  mind,  moreover,  that 
the  combat  model  considered  in  this  paper  is, far  too  simple  to  have  the 
results  obtained  from  it  be  taken  literally,  but  he  should  interpret  these 
results  as  indicating  general  principles  or  hypotheses  to  be  further 
investigated  by  higher  resolution  analysis  techniques  such  as  field  experi- 
mentation or  computer  simulation.   Nevertheless,  it  is  hoped  that  our  simple 
model  (2)  has  provided  some  insights  into  the  optimization  of  this  combat 
process.   Thus,  it  is  the  form  of  the  optimal  policy  and  its  functional 
dependence  on  model  parameters  that  is  of  primary  interest. 

In  this  paper  we  have  considered  a  version  of  the  two-on-one  fire 
distribution  problem  considered  elsewhere  [14],  [18].   In  the  version  (2) 
considered  here  there  are  limits  on  the  rate  at  which  the  distribution  of 
fire  can  change.   This  might  be  thought  of  as  reflecting  a  command  and 
control  limitation  (e.g.  the  existence  of  a  time  lag  between  the  giving 
of  an  order  and  its  implementation).   From  our  above  study,  we  conclude 
that  the  structure  of  the  optimal  fire  distribution  policy  for  problem  (2) 
(a)  depends  upon  the  model  for  the  attrition  of  X-force  target  types  and 
not  upon  the  nature  of  the  model  for  redistribution  of  fire  (see  Section  9) 
(as  long  as  this  does  not  change  the  functional  form  of  enemy  target-type 
attrition);  (b)  depends  upon  the  following  model  parameters  (see  [16]  for 
further  discussion):   (1)   a.b.   for   i  =  1,2,   (2)   a  p  and  a„q,   and   (3) 
whether  Y  wins  or  loses;  and  (c)  is  very  similar  to  that  for  the  inertia- 
less  combat  case. 

To  elaborate  further  about  (c) ,  the  reader  can  find  results  for  the 
inertialess  combat  problem  (1)  reported  in  [14],  [18].   When  these  are 
compared  with  those  for  the  problem  at  hand  (2) ,  there  are  seen  to  be  many 
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similarities  (see  below) .   Both  models  have  the  same  attrition  structure 
(for  X-force  target  types) ,  although  the  model  (2)  considered  here  incor- 
porates command  and  control  limitations.   As  long  as  these  do  not  affect 
the  function  form  of  the  attrition  process  of  enemy  target  types,   the 
optimal  policies  are  similar  (see  also  Section  9) . 

We  saw  that  for  both  "inertialess"  and  "inertial"  combat,  we  must 
necessarily  have 

albl  *  a2b2> 

in  order  for  it  to  be  optimal  to  drive  x   to  zero  (while  x  >  0  and 
before   t  =  T) .   Furthermore,  we  also  developed  a  necessary  condition 
involving  v(t)  =  a  (-p  (t))  -  a  (-p  (t))  for  it  to  be  optimal  in  (2)  to 
have  <j>(t)  =  0   for   1   for  a  finite  interval  of  time.   Again,  the  results 
were  similar  to  those  for  the  "inertialess"  combat  problem  (1) . 

The  sign  of  the  quantity  v(t)  =  a  (-p  (t))  -  a_(-p„(t))   reflects 
the  ranking  of  target  priorities:   when  v(t)  >  0,   X   is  a  higher 
priority  target  for  Y   than  is  X_ ;  and  the  situation  is  reversed  when 
v(t)  <  0.   A  significant  result  (see  Section  5)  was  that  for  "inertial" 
combat  an  optimal  policy  for  the  distribution  of  Y-fire  over  enemy  target 
types  is  characterized  by  beginning  to  change  the  distribution  of  fire 
(i.e.  shifting  of  fires)  before  target  priorities  (as  measured  by  the  sign 


* 
In  [13]  Schreiber  formulates  a  Lanchester-type  combat  model  in  which  the 

effectiveness  of  intelligence  and  command  and  control  systems  modifies  the 

form  of  combatant  attrition.   These  capabilities  are  incorporated  into 

Schreiber' s  model  through  a  parameter   6  [0,1]   which  he  denotes  as  "command 

efficiency."   His  equations  reduce  to  Lanchester's  classical  equations  for 

area  fire  when  "command  efficiency"  is  equal  to  zero  for  each  side  and  to 

those  for  aimed  fire  when  it  is  equal  to  one. 
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of  v(t))  change.   (It  should  be  recalled  that  in  the  "inertlaless"  version 
of  this  problem  all  fire  was  concentrated  on  the  target  type  with  the 
higher  priority  and  instantaneously  shifted  when  this  changed.)   In  other 
words,  due  to  decreased  reaction  ability  one  begins  to  change  the  distri- 
bution of  fire  before  target  priorities  change  in  anticipation  of  this  coming 
change.   We  might  even  generalize  this  result  to  say  that  with  slow  reactions 
one's  optimal  policy  involves  anticipating  enemy  actions. 

It  should  finally  be  pointed  out  that  to  the  best  of  the  author's 
knowledge  the  problem  (2)  considered  in  this  paper  is  the  first  one  with 
a  higher  order  (i.e.  greater  than  first  order)  SVIC  (see  [3],  [6],  [22]) 
to  be  considered  in  the  operations  research  literature.   Moreover,  the 
complete  treatment  of  this  problem  was  made  possible  by  some  recent  results 
of  the  author  [21],  [22]. 


* 
Examples  of  problems  with  an  SVIC  of  order  greater  than  one  have  appeared, 

of  course,  previously  in  the  engineering  literature  (see  [3],  [6]). 
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